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Introduction and Motiviation 


The guitar pedal has been used since the 1950’s to amplify, modulate, delay, 
and otherwise modify the sound of a guitar to create interesting sound 
effects. Today, there are literally hundreds of guitar pedals on the market 
that can do compression, wah-wah, overdrive, chorus, flanger, phase-shift, 
time-delay, and reverberation sound effects. In particular, there is a class of 
effects in which the pitch is shifted in order to create intervals. However, 
many of the guitar pedals currently available can only make the same 
interval for each pitch--even though music theory tells us that in a key, not 
all chords are harmonized the same way. Thus, we designed a pre-amp that 
will harmonize a note depending on the chosen key. Surprisingly, of the 
many guitar pedals available, we could not find a guitar pedal on the market 
that could harmonize a note at all, much less harmonize based on different 
keys. Soon, we would learn why this guitar pedal does not exist. 


Knowing that there is always a first time for everything, we were never 
phased. Currently, guitar pedals which change the tone can only do so in 
fixed intervals--which allows for incomplete harmonization. We wanted to 
design a guitar pedal which could harmonize different notes in the key with 
a different interval--which allows for better harmonization. 


Our original strategy was as follows: first, we sampled at 44.1 kHz, a 
frequency greater than the Nyquist frequency and the standard for sampling 
music. Then, we employed the FFT to analyze the signal in the frequency 
domain. Third, we identified the pitch. Fourth, we modulated the pitch by a 
major third or a minor third depending on the pitch. Fifth, we combined the 
modified signal and the original signal to create the harmonized signal. 
Last, we took the IFFT and sent the new signal to the amplifier. 


Like many engineering projects, the project did not turn out to be quite as 
easy as our projection. Reliably identifying the note being played in real- 
time is difficult, as is playing a harmonized note that doesn’t sound choppy 
yet changes when a new note is played. Read on to see if we found the key. 


History of Guitar Pedals 


Technology is amazing. For example, when I wake up in the morning, I 
reach for my smartphone and turn-on some music to start my day. And yes, 
those are the opening lyrics to “More than a Feeling” by Boston, you can 
put away your smartphone now. 


Barry Goudreau, Tom Scholz, Sib Hashian, Brad Delp, Fran Sheehan, in 
1977. 


But what you may not know is that Tom Scholz, the founding member of 
Boston and its primary songwriter, attended MIT and actually invented a 
guitar pedal to use. In doing so, Scholz joined the long tradition of 
musicians discovering ways to change their sound to make theirs sounds 
different, unique, and interesting. Witness the vast number of trumpet 
mutes: straight, cup plunger, bucket, harmon stem in/stem out. But because 
the electric guitar needs electronics to even make a sound, guitarists 
realized that by changing the electronics, they could create new and 


interesting sounds. In the 1950’s and 60’s, guitarists changed the electronics 
by actually physically modifying (breaking) the hardware by using razor 
blades to cut up the cones in the amplifiers to make the guitar sound 
“gritty.” Soon, guitarists soon realized that instead of breaking the current 
technology, they could simply create ones to deliberately change the sound 
--and the guitar pedal was born. 


In doing so, Scholz joined the long tradition of musicians discovering ways 
to change their sound to make theirs sounds different, unique, and 
interesting. Witness the vast number of trumpet mutes: straight, cup 
plunger, bucket, harmon stem in/stem out. But because the electric guitar 
needs electronics to even make a sound, guitarists realized that by changing 
the electronics, they could create new and interesting sounds. Guitarists 
soon realized that they could simply create new electronics to deliberately 
change the sound --and the guitar pedal was born. 


There is a “signal chain” where the electrical guitar is connected to the 
“guitar pedal” which is then connected to the large amplifier. The effects 
can be split into three main areas: volume and dynamic, modulation, and 
time-based effects. Dynamic effects include those of compression (makes 
dynamic range of guitar smaller), wah-wah (makes sound louder-then 
softer-then-louder, like the word wah-wah), and overdrive (distorts the 
sound by adding harmonics and inharmonics, originally made by making 
the vacuum tube larger). Modulation effects can include a flanger (adds the 
time-delayed output to itself in various intervals, resulting in a comb-filter 
in frequency domain), phase shifters (change the phase in frequency 
domain). Time-based effects include delay (play sound after fixed period), 
and reverberation (make a reverberation sound by making delays of a sound 
in smaller and smaller amplitude). 


Currently, the Eventide “Pitchfactor” is one of the current leaders in pitch- 

modification. One of the cool effects that the “Pitchfactor” can do is output 
harmonized notes at a specific delay. Using this device, a guitar player can 

harmonize with himself and even make chords. However, the “Pitchfactor” 
is $500, so our group tried to see if we could make a similar effect at much 
cheaper cost. 


Citation: Wikipedia, "Effects Units" 


Music Theory 


In music, an octave is two notes such that one is double the fundamental 
frequency of the other, where we call both notes the same. We use letters 
between A through G, to denote notes. Within an octave, there are twelve 
semitones, otherwise known as half-steps. Western music theory defines a 
key within a given octave, with the first half of the name defining how the 
scale is determined and the second half defining what is the tonic, or the 
initial note we wish to start our scale on. For our project, we began first 
with Major C, which begins on C and proceeds in the following pattern 
with “W” representing a whole step, or 2 semitones, and “H” representing a 
semitone: W-W-H-W-W-W-H. Major scales are the easiest to play on piano 
since it consists primarily of the white keys (C-D-E-F-G-A-B-C ...) and are 
the most common in popular music. 


The next progression in making music is to play more than one note at a 
time. When you play two notes, you create an interval between the two 
notes. We name intervals based on the difference between the letters (i.e. C 
and G are 5ths if counting up from C) as well as the amount of semitones 
between them. The more important intervals for creating chords are a minor 
3rd, which has 3 semitones, and a major 3rd, which has 4 semitones. 


A chord is a collection of three or more notes played all at the same time. 
For our project, we decided to start with Major chords since they are the 
easiest to hear and pleasant to most people. A major chord can also be 
called a triad, since it consists of three notes. If within the key of C, a C 
major chord will consist of C, E, and G. The intervals between C and E is a 
major third and the interval between E and G is a minor third. However, this 
logic cannot be applied to all triads within C major. For example, an E 
chord in the key of C will consist of E-G-B. The interval between E and G 
is still a minor third but the interval between G and B is a major third, 
creating a Minor E chord within the key of C. Chords within the key of C 
fall into the following three categories: Major third-Minor Third (C-E-G, F- 
A-C, and G-B-D), Minor Third-Major Third (D-F-A,E-G-B, and A-C-E), 
and Minor Third-Minor Third (B-D-F). You can generalize this by simply 
changing the tonic and therefore shifting the names of the notes according 
to the scale. Therefore, if you wish to create a chord within a given key, you 


must identify the note currently being played and the key you wish to play 
in. 


Mathematically, you can divide an octave up into 12 equal parts. If you take 
the ratio of two notes an octave apart, you will get a ratio of 2 (or 2 to some 
power). To have an equidistant scale, you take the logarithm, base 2, of the 
ratio. If you want 12 equal parts, multiply the ratio times 12 and you will an 
equal division. Since a musician’s ear is picky and we want more accuracy, 
you typically times this by 100 to allow for slightly sharp or slightly flat 
notes to be created. This is equal to “n” cents. Cents is a term used by my 
musicians to describe tuning. This method can be generalized to find the 
cents between any two notes, with frequency f1 and f2 and log being of log, 
base 2: 


n = 1200 * loon 


2 


For our project, n*100 denotes the amount of semitones we wish to g, f2 
denotes the frequency of the key (in the correct octave, it must be below the 
note we are playing) we are currently in, f1 denotes the frequency of the 
note currently being played. When you apply the formula in this way, you 
can identify the note being played based on the tonic you prescribe. 
Knowing this formula, you can now do the reverse where you know the 
tonic and know how many semitones we wish to move up to compute the 
frequency of the note and solve for f1. 


Theory and Process 


In theory, the process we planned to use to create a harmonizing guitar 
pedal is very linear and straightforward. First we lowpass filtered the output 
signal of the guitar, then sampled it in order to get the signal in a digital 
form. With the digital signal, we could then lowpass filter it even farther to 
get rid of some high frequency noise, this time using an FIR filter. We then 
took the discrete Fourier transform (DFT) of the signal, so that we could 
analyze the frequency content of the signal. We identified the note being 
played by the guitarist, calculated the harmonics of this note, if any existed 
in the chosen key, and added dirac deltas at the harmonic frequencies to the 
original guitar signal to produce the out output chord. We then took the 
inverse DFT of this new signal, and output the time domain version to the 
guitar amplifier. through this process, if the original note played on the 
guitar was in the correct key, the signal that actually reached the amplifier 
would be a chord with that note as the base, still in the original key. 


The first step in the process is sampling the analog, continuous time signal 
produced by the guitar and converting it into a sampled discrete time digital 
signal that we can process on a computer. The minimum sampling rate for a 
signal is two times the highest frequency component of the signal. Because 
of this fact, before we could sample the guitar output we had to run it 
through a lowpass filter such that it became band limited. We planned on 
sampling the signal at a rate of 44.1 kH, and therefore we needed to ensure 
that any and all of the frequency components of the original signal above 
22.5 kHz were reduced to zero by the lowpass filter. Once the filtering is 
done, it is trivial matter of running the signal through an analog to digital 
converter (ADC) which samples at the rate we desire, 44.1 kHz. The output 
of the ADC can then be sent directly into our computer for processing. In 
our actual procedure, we used a device that did both the filtering and ADC 
sampling for us, as discussed below. 


Once we have the digital signal in our computer, we begin to do the actual 
signal processing. FIrst, we once again low pass filter a copy of signal, this 
time at a lower frequency, for example 8 kHz, simply so that we could do 
our signal processing without worrying about high frequency noise. We 
retain the original guitar signal so that when we alter the output we can add 


our harmonic tones to the original signal. Then, if the magnitude of the 
input at the current time suggests that a note has just been played, we begin 
our frequency domain processing. We begin by using the fast fourier 
transform algorithm (FFT) to compute the DFT coefficients of our signal. 
It’s important that we choose a fairly short (~1/20th of a second) sample 
length from the signal to compute the DFT, as we only want one note to be 
present in the output of the guitar during the period of the output that we are 
analyzing. Once we have obtained the frequency domain representation of 
our input signal, we find the frequency component with the largest 
magnitude. We identify this frequency as the note which the guitar player 
played. Given this note and the key in which the guitarist is playing, we are 
able to calculate the correct harmonic frequencies, if there are any, as 
discussed above. We then increase the magnitudes of these harmonic 
frequencies to a magnitude similar to that of the core, “note” frequency. 
This gives us the frequency representation of our output signal. Finally, we 
take the inverse FFT of this signal. This gives us the time domain 
representation of our final output, which we then pass through a digital to 
analog converter (DAC) and on to our amplifier, which produces the signal 
as if it had come straight from the guitar. This process sounds very simple 
and straightforward in theory. However, in our actual implementation we 
ran into several problems and complications which showcase the difference 
between the theory and reality of digital signal processing. 


Implementation 


High Level Design 


The high level design of our project is as follows. We have a Behringer 
UCA222 that serves as the interface between the pickup in the guitar and 
the computer. The final output is played on the speaker that is connected to 
the UCA via a 3.5mm audio jack. The UCA222 takes the guitar input 
through dual channel RCA ports and forwards the signal to the computer 
through USB. We then do computation and analysis, in Python, on the 
sample signal to determine the frequency and the identity of the note being 
played, and depending on the note synthesis the correct harmony. The 
synthesized signal is fed back into the UCA222 and outputted on the 


speaker. 
Behringer 
UCA222 Computer 


Figure 1: Block Diagram of our System 


Figure 2: Picture of our Setup 


Sampling and Frequency Analysis 


We sampled the analog signal of the pickup through the UCA222. The 
UCA222 is programmed to anti-alias and sample and forward the data into 
system audio. We can then access system audio via a Python library called 
PyAudio. PyAudio is able to give us vectors of the samples of a desired 
length. Compared the difference in latency to our system between different 
vector length and decide that 1024 was a good middle ground. We decided 
to further lowpass filter the sampled chunks to reduce high frequency noise 
introduced by the ‘fret-buzz’ and the initial attack of the guitar string. 


import numpy 
import pyaudio 
import scipy 
import math 


# constants 

fs = 44100.0 

CHUNK = 1024 

cutoff_hz = 8000.0 

nyq_rate = fs / 2. # for normalizing 
width = 1000.0/nyq_rate 

ripple_db = 30.0 


# open pyaudio stream 

p = pyaudio.PyAudio( ) 

stream = p.open(format=pyaudio.paFloat32, 
channels=1, rate=44100, 
input=True, 
output=True, 
frames_per_buffer=CHUNK ) 


# build filter 

N, beta = scipy.signal.kaiserord(ripple_db, width) 
taps = scipy.signal.firwin(N, cutoff_hz/nyq_rate, 
window=('kaiser', beta) ) 

delay = 0.5 * (N-1) / fs 


while(True): 

data = stream.read(CHUNK) 

xX = numpy.fromstring(data, 
dtype=numpy.float32) 

x_filtered = scipy.signal.lfilter(taps, 1.0, 
x) 

X = numpy.fft.fft(x_filtered) 


stream.close() 
p.terminate() 


From this, we are able to analysis each frame of 1024 samples and compute 
the frequency content of the frames. We played a few different notes and 
plotted the frequency content of notes in hopes of figuring out a good way 
to differentiate between notes. Below are plots of a note being struck in the 
time domain, and frequency spectrum plots of a high and low C note. 


Figure 3: Time Domain Plot of a note being played 


Figure 4: Frequency Spectrum of a High and Low C 


Note Identification 


After looking at the frequency domain, it is obvious that each note has one 
distinct peak at a particular frequency. So we decided to do note detection 
by finding the frequency with the largest magnitude. In other words, we are 
doing max-peak detection. In doing this, we took the absolute value of the 
frequency bins in the positive half of the spectrum and use the argmax 
function from the NumPy library obtain the frequency bin with the highest 
peak. However, in using this method we very quickly encountered a 
problem. When a certain note is played on the guitar, though the profile of 
the note in the frequency domain will not change very much, the maximum 
peak in fact moves around a little. Because other approach only looks at the 
highest peak, sometimes we will get errors when the neighboring peaks 
overtake the correct main peak in magnitude. When this happens our 
program will think the note is quickly changing, but actually it is just the 
same note reverberating through the guitar. 


The solution we came up with to alleviate this problem is to average what 
the program thinks is the highest frequencies across multiple frames of data. 


We tested out different average lengths and found that averaging every 25 
frames sounded the best. Our code will keep track of a variable-length list 
that contains the frequency of the highest peak per frame. At the start of 
every frame a new frequency value will be added to the list, and at the end 
of every frame the list is averaged to determine a single frequency. This list 
resets every 25 frames. In midst of this process, we also converted to linear 
frequency from FFT bins, which is done by multiplying by the ratio of the 
sampling frequency over half the chunk length. To further improve our 
results, we only added to the list frequencies if the magnitude of that 
particular frequency bin is above a particular value. This way our code will 
ignore the low amplitude noise and only record frequencies when a note is 
being played. 


flag = 0 
freq = [] 
while(True): 


data = stream.read(CHUNK) 

xX = numpy.fromstring(data, 
dtype=numpy.float32) 

x_filtered = scipy.signal.lfilter(taps, 1.0, 
x) 

X = numpy.fft.fft(x_filtered) 

highest_freq = numpy.argmax(abs(X[0:511])) 


flag +=1 

if (X[highest_freq] > 5000): 
freq.append(highest_freq*44100/(CHUNK/2) ) 

avg = (sum(freq)/len(freq) ) 


if flag == 25: 
flag 0 
freq [ | 


Creating the Harmony 


Once we have the correct frequency and the identity of the note, we proceed 
to figure out the frequency of the harmonizing note. We first create a 
Python dictionary of musical keys mapped to a list of the corresponding 
frequencies of that note at different octaves. For purposes of demonstration, 
we only added the key of C to the dictionary. However, other keys could 
easily be added. We then created a function that helps on determine which 
harmonic we are in given a musical key and the frequency of a note. We 
then apply the formula for finding the harmonizing frequencies describe in 
the previous section to the frequency we have and obtain the frequency of 
the third and fifth intervals from the note being played. 


keys 
sBuf 


{'c':[1636, 1636/2, 1636/4, 1636/8]} 
50 


def key_select(freg, key): 
for f in keys[key]: 
if freq >= f: 
return f 
return freq 


def chord_freqs(key, note): 
n = 1200*numpy.log2(note/key ) 
if (500-sBuf<= n <=500+SBuf or 700-SBuf<= 
n<=700+sBuf or n<=sBuf): 
f3 = note * math.pow(2, (400)/1200) 
f5 = note * math.pow(2, (700)/1200) 
return f3 , f5 
elif (200-sBuf<n<200+sBuf or 400- 
sBuf<n<400+sBuf or 900-sBuf<n<900+SBuf ): 
f3 = note * math.pow(2, (300)/1200) 
f5 = note * math.pow(2, (700)/1200) 
return f3, f5 


elif (1100-sBuf<n<1100+sBuf ): 
F3 note * math.pow(2, (300)/1200) 
5 note * math.pow(2, (600)/1200) 


return f3, f5 
else: 
return 0,0 


After obtaining the two harmony frequencies, we tried to take them into the 
time domain. We initially tried add triangles of frequency content centered 
at the desired frequencies and then taking the inverse transform, in hope to 
try to mimic a "real" sound. However, doing that caused the note to sound 
slightly off. Also we didn't add imaginary components to the frequency and 
the FFT library gave us warnings. So we decided to try out other 
approaches. The next thing we thought of was using the Karplus-Strong 
Plucked String algorithm to create a realistic decaying note. But we very 
quickly saw that the algorithm was not really meant to be used in real time. 
The algorithm works by creating the complete decaying signal. However, 
this is problem with other stream because we can only output 1024 samples 
at a time, which is not long enough to squeeze the whole decay in. We 
thought about storing the whole signal from the Plucked String algorithm 
and output it a frame at a time. But doing so not only slow down our 
system, but also would introduce problems if we played a different note half 
way through a output decaying note. We ended up deciding to just output 
pure sine waves and worry about making it sound less synthesized in the 
future. 


def play_tone(stream, frequency=252, Length=1, 
rate=44100): 

chunks = [] 

f3 , f5 = chord_freqs(key_select(frequency, 
'c'), frequency) 

chunks.append(sine(f3, length, rate)+sine(f5, 
length, rate) ) 


chunk = numpy.concatenate(chunks) * 0.25 


stream.write(chunk.astype(numpy.float32).tostring( 
), CHUNK) 


def sine(frequency, length, rate): 

length = int(length * rate) 

factor float(frequency) * (numpy.pi * 2) / 
rate 

return numpy.sin(numpy.arange(length) * 
factor ) 


Now that we were only outputting pure sine waves, we decided that it 
would be faster and save computation complexity but just creating the sine 
waves of the correct frequencies in the time domain directly. This would 
Save computation time as now we don't need to compute an inverse FFT. To 
put it all together. We output the harmonies through the PyAudio stream. 
The UCA222 takes this and adds it to the guitar signal that it is already 
forwarding to the speakers. 


Conclusion 


In the end, our team were mostly successful in our original plan to 
harmonize each note in real-time. We could correctly identify the note being 
played, identify the correct octave, calculate the frequencies of the notes 
required to complete the triad, and then output the harmonized result. In 
practice though, there remain challenges that limit the practical use of the 
device. I will list these challenges in order of difficulty and importance. 


First, a small delay was necessary to allow the guitar sound itself to 
stabilize. If this delay is not present, the harmonized frequency change very 
quickly, resulting in an unsettling sound effect. To solve this problem, a 
more complex note identification program is necessary. This program 
would need to (1) ignore the initial “pluck” of the string and (2) use 
knowledge of the harmonics to more accurately predict the note being 
played. However, note that this delay still results in a fully-functioning 
affect--in fact, one of the effects of the Eventide’s Harmonizer is a delayed 
harmony (a.k.a. what we implemented). The only difference is that the 
Eventide’s max allowable speed is probably higher than our maximum 
speed. 


Second, our “harmonized” notes were pure sine waves at the specified 
frequency, which sounds like a pure tone as opposed to a guitar sound. 
Using a “plucked-string” algorithm and/or other synthesizer techniques, we 
could make the harmonized notes sound as if they were played by different 
instruments such as a guitar, violins, trumpets, tubas, harp, organ, or piano. 
This functionality would greatly improve the appeal of the device. 


Third, we could add the “pedal” hardware in order to give the user the 
ability to turn the harmonized effect on or off while he’s playing. 


Fourth, due to the exponential nature of frequency (to reach the next octave, 
the frequency doubles), note resolution was much better at higher 
frequencies. This problem could be solved by upsampling or sampling for a 
longer period of time, either of which would result in higher resolution in 
the frequency domain. 


Fifth, we only had two “presets” for how device would harmonize each key. 
For the key of C, the first harmonizes each note to stay within the key of C 
Major. These chords are thus C, F, G major, D, E, A minor, and B 
diminished. The second preset was harmonized so that each note stays 
within has C minor, which is made of the chords of C,D, F, G minor, Eb, 
Ab, and Bb major. Neither preset harmonizes notes not in the key. However, 
we can trivially extend our code to add presets of more types of codes such 
as Dominant, Major, Minor, Augmented, Diminished, Seventh, Ninth, ... 
We could even allow for a manual mode in which the user can specify a 
specific interval for each of the 12 notes in the octave. 


In conclusion, we were able to create a pre-amplifier that created an 
interesting harmonizaiton sound effect--especially if you like alien noises. 
The link below is of our poster. 


We would like to thank Richard Baraniuk, Jeff Lievense and Nancy Jia for 
being T.A.'s for 301. 


https://drive.google.com/file/d/OB4uCk1m20E-JT05mc19tcO5NbjQ/view? 
usp=sharing 


